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Remarks/Arguments 

The elected claims are directed to a method for identifying a HCV RNA-dependent RNA 
polymerase inhibitor using an in vitro assay. The present application provides data 
demonstrating that NS5B does in fact produce RNA-dependent RNA polymerase activity and 
that sufficient polymerase activity can be produced in vitro to identify a polymerase inhibitor. 
The application provides motivation, and examples of techniques, for the skilled artisan to move 
beyond mere identification and characterization of polymerase activity into assaying for 
polymerase inhibitors, 

35 UXC 103 (Obviousness) 

Claims 12, 14, 16, 17 and 18 stand rejected as allegedly obvious based on either (1) 
Behrens et al. (EMBO 15(1): 12-22, January 1, 1996); (2) Al et al. (Hepatology 22(4 part 2): 
331 A, October 1995); or (3) Tomei et al. (Journal of Virology 67(7): 4017-4026, July 1993). 
There rejections are respectfully traversed. 

The prior office action notes that the rejections based on Behrens et al. and Al et al. may 
be overcome by filing a translation of the claimed priority document. Enclosed is a copy of a 
certified translation of the priority document. A copy of the translation was also filed in the 
parent application. 

The office action argues that Tomei et al. suggests that NS5B may act as a viral replicase 
based on the presence of a GDD sequence. Tomei et al. is argued to provide motivation for 
further characterizing the function and roles of NS5B encoded proteins, determining whether 
proteolytic processing affects NS5B protein product, characterizing the specific mechanism of 
RNA-dependent RNA polymerase activity, and being used to identify potential therapeutics. 

It is respectfully submitted that Tomei et al., fails to provide a reasonable expectation of 

success that NS5B can be used in an in vitro assay for identifying a HCV RNA-dependent RNA 

polymerase inhibitor. Tomei et al. speculates that NS5B may act as a viral replicase and points 

out that the HCV NS5 region is processed differently than flavivirus NS5B: 

The NS5 region of the HCV polyprotein is cleaved into two smaller 
products of 47 and 65 kDa; the processing of this region therefore differs from 
that of flavivirus NS5, which is released from the polyprotein precursor as a 
single protein of 1 10 kDa. The GDD consensus sequence characteristic of RNA- 
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dependent RNA polymerases is located in NS5b (residues 2736 to 2738), 
indicating that this protein may act as a viral replicase during HCV-specific RNA 
synthesis (17). However, NS5a could also have a function in the replication of 
the viral genome, acting as a component of the replication complex involved in 
the reaction. [Emphasis added.] 

Tomei et al., fails to provide any data measuring RNA-dependent RNA polymerase activity from 
NS5B in vitro. 

Absent data demonstrating that NS5B provides RNA-dependent RNA polymerase 
activity, the skilled artisan would have no motivation to perform an assay measuring RNA- 
dependent RNA polymerase activity from NS5B in vitro to assay for a HCV polymerase 
inhibitor. Not only is there uncertainty as to (1) whether NS5B provides RNA-dependent RNA 
polymerase, but there is also uncertainty as to (2) whether NS5B, if it encodes the polymerase, 
can be used in an in vitro assay to generate sufficient activity to identify inhibitors. 

Double Patenting 

The pending claims stand rejected for double patenting based on U.S. Patent No. 
6,383,768 ('768 patent). Claims 12, 14, 16, 17 and 18 stand rejected under obviousness type 
double patenting. Claims 13, 15 and 19 stand rejected for same invention double patenting. The 
rejections are respectfully traversed. 

With respect to obviousness-type double patenting, as noted by the Examiner, applicants 
may overcome the rejection by filing a terminal disclaimer. Because of the possibility that the 
claims may be amended in the future, applicants have not yet filed a terminal disclaimer. 

Pending claims 13, 15 and 19 were rejected for same invention double patenting based on 
claims 8, 15 and 1 1 of the '768 patent. The differences between these claims can be illustrated 
by the difference between pending claim 13 and '768 patent claim 8. Pending claims 15 and 19 
each depend on claim 13. '768 patent claims 15 and 1 1 depend from '768 patent claim 8 and 
provide similar limitations as pending claim 15 and 19. 

• Pending claim 13 differs from the '768 patent claim 8, for example, in that pending claim 
13 indicates that NS5B is the only HCV protein present during said incubation. In contrast, the 
'768 patent claim 8 characterizes the source of NS5B provided to an assay composition as "said 
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NS5B is provided to said composition from a preparation wherein said NS5B is the only HCV 
protein present . . . 

Accordingly the claims are in condition for allowance. Please charge deposit account 13- 
2755 for fees due in connection with this amendment. If any time extensions are needed for the 
timely filing of the present amendment, Applicants petition for such extensions and authorize the 
charging of deposit account 13-2755 for the appropriate fees. 



Respectfully submitted, 




Sheldon O. Heber 
Reg. No. 38,179 
Attorney for Applicants 



Merck & Co., Inc. 
P.O. Box 2000 
Rahway, NJ 07065-0907 
(732) 594-1958 
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D. TITLE 

^^METHOD FOR REPRODUCING IN VITRO THE RNA-DEPENDENT RNA POLYMERASE 
AND TERMINAL NUCLEOTIDYL TRANSFERASE ACTIVITIES ENCODED BY 
HEPATITIS C VIRUS (HCV) " 

L . ABSTRACT 

This is a method for reproducing in vitro the RNA-dependent 
RNA polymerase activity associated with hepatitis C virus. The 
method is characterized in that sequences, contained in NS5B are 
used in the reaction mixture. The terminal nucleotidyl 
transferase activity, a further property of the NS5B protein, can 
also be reproduced using this method. The method takes advantage 
of the fact that the NS5B protein, either purified to apparent 
homogeneity or present in extracts of overproducing organisms, 
can catalyze the addition of ribonucleotides to the 3' termini of 
exogenous or endogenous RNA molecules. 

The invention also relates to a composition of matter that 
comprises sequences contained in NS5B, and to the use of these 
compositions for the set up of an enzymatic test capable of 
selecting, for therapeutic purposes, compounds that inhibit the 
enzymatic activity associated with NS5B. 

Figure 1 shows plasmids used in the method to produce 
hepatitis C virus RNA-dependent RNA polymerase and terminal 



nucleotidyl* transferase in cultivated eukaryotic and prokaryotic 
cells. 




P .ETL = Promoter of the gene coding for the PCNA protein 
P PH = Promoter of the polyhedrin gene 

Amp = Gene coding for the fi-lactamase enzyme (ampicillin 

resistance) 

LacZ (fi-gal) = Gene coding for the S-galactosidase enzyme 
Col El = pBR322 replication origin 
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DESCRIPTION of the Industrial Invention entitled: 

^^METHOD FOR REPRODUCING IN VITRO THE RNA-DEPENDENT RNA POLYMERASE 
AND TERMINAL NUCLEOTIDYL TRANSFERASE ACTIVITIES ENCODED BY 
HEPATITIS C VIRUS (HCV) " 

of the Italian Institute: ISTITUTO DI RICERCHE DI BIOLOGIA 

MOLECOLARE P. ANGELETTI S.p.A. 

With headquarters in POMEZIA, ROME (ITALY) 



DESCRIPTION 

The present invention relates to the molecular biology and 
virology of the hepatitis C virus (HCV) . More specifically., this 
invention has as its object the RNA-dependent RNA polymerase 
(RdRp) and the nucleotidyl terminal transferase (TNTase) 
activities produced by HCV, methods of expression of the HCV RdRp 
and TNTase methods to test RdRp and TNTase activities in vitro 
encoded by HCV in order to identify, for therapeutic purposes, 
compounds that inhibit these enzymatic activities and therefore 
might interfere with the replication of the HCV virus. 

As is known, the hepatitis C virus (HCV) is the. main 
etiological agent of hepatitis C (HCV) and the main etiological 
agent of non-A, non-B hepatitis (NANB) . It is -estimated that HCV 
causes at least 90% of post-transfusional NANB viral hepatitis 
and 50% of sporadic NANB hepatitis. Although great progress has 
been made in the selection of blood donors and in the 
immunological characterization of blood used for transfusions, 
there is still a high number of HCV infections among those 
receiving blood transfusions (one million or more infections 
every year throughout the world) . Approximately 50%^ of HCV- 
infected individuals develop cirrhosis of the liver within a 



period that can range from 5 to 4 0 years. Furthermore, recent 
clinical studies suggest that there is a correlation between 
chronic HCV infection and the development of hepatocellular 
carcinoma. 

HCy is an envelope virus containing an RNA positive genome 
of approximately 9.4 kb. This virus is a member of the 
Flaviviridae family, the other members of which are the 
flaviviruses and the pestiviruses . The RNA genome of HCV. has 
recently been mapped.. Comparison of sequences from the HCV 
genomes isolated in various parts of the world has shown that 
these sequences can be extremely heterogeneous. The majority of 
the HCV genome is occupied by an open reading- frame (ORF) that 
can vary between 9030 and 9099. nucleotides. This ORF codes for a' 
single viral polyprotein,' the length of which can vary from 3010 
to 3033 amino acids. During the viral infection cycle, the 
polyprotein is proteolytically processed into the individual gene 
products, necessary for replication of the virus. The genes coding 
for HCV structural proteins are located at the 5' end of the ORF, 
whereas the region coding for the non- structural proteins 
occupies the rest of the ORF. 

The structural proteins consist of C (core, 21 kDa) , El 
(envelope, gp37) , and E2 (NSl, gp61) . C is a non-glycosylated 
protein of 21 kDa that probably forms the viral nucleocapsid. The 
protein El is a glycoprotein of approximately 37 kDa, believed to 
be a structural protein in the outer envelope of the virus. E2, 
another glycoprotein with a 61 kDa membrane, probably constitutes 
a second structural protein of the outer envelope of the virus. 

The non- structural region starts with NS2 (p24) , a 
hydrophobic protein of 24 kDa whose function is unknown. NS3, a 
protein of 68 kDa that follows NS2 in the polyprotein,. has two 



functional domains: a serine protease domain in the first 200 
amino- terminal amino acids, and an RNA-dependent ATPase domain at 
the carboxy terminus. The gene region corresponding to NS4 codes 
for NS4A (p6) and NS4B (p26) , two hydrophobic proteins of 6 and 
26 kDa, respectively, whose functions have not yet been 
clarified. The gene corresponding to NS5 also codes for two 
proteins, NS5A (p56) and NS5B (p65) , of 56 and 65 kDa, 
respectively. 

Various molecular biology studies indicate that the signal 
peptidase, a protease associated with the endoplasmic reticulum 
of the host cell, is responsible for proteolytic processing in 
the non- structural region, that is to say, at sites C/El, E1/E2 
and E2/NS2. A second protease activity of HCV appears to be 
responsible for the cleavage between NS2 and NS3 . This protease 
activity is contained in a region comprising both the part of NS2 
and the part of NS3 containing the serine protease domain, but 
does not use the same catalytic mechanism. The serine protease 
contained in NS3 is responsible for cleavage at the junctions 
between NS3 and NS4A, between NS4A and NS4B, between NS4B and 
NS5A, and between NS5A and NS5B. 

Similarly to other (+) -strand RNA viruses, the replication 
of HCV is thought to proceed via the initial synthesis of a (-)- 
RNA strand, which serves, in turn, as template for the production 
of progeny (+) -strand RNA molecules. An RNA-dependent RNA 
polymerase (RdRp) has been postulated to be involved in both 
these steps. An amino acid sequence present in all the RNA- 
dependent RNA polymerases can be recognized within the NS5 
region. This suggests that the NS5 region contains components of 
the viral replication machinery. Virally-encoded polymerases have 
traditionally been considered important targets for inhibition by 
antiviral compounds. In the specific case of HCV, the search for 



such substances has, however, been hindered by the lack of both a 
suitable model system of viral infections (e.g., infection of 
ceils in culture .or a facile, animal model) and a functional RdRp 
enzymatic assay. 

It has now been unexpectedly found that this important 
limitation can be overcome by adopting the method according to 
the present invention, which also gives additional advantages 
that will be evident from the following. 

The present invention has as its object, a method for 
reproducing in vitro the RNA-depehdent RNA polymerase activity of 
HCV that makes use of sequences contained in the HCV NS5B 
protein." The terminal nucleotidyl transferase activity, a further 
property of the NS5B protein, can also be reproduced using this 
method. The method takes advantage of the fact that NS5B can be 
expressed in either eucaryotic or prokaryotic heterologous 
systems: the recombinant proteins containing sequences of NS5B, 
either purified to apparent homogeneity or present in extracts of 
overproducing organisms, can catalyze the addition of 
ribonucleotides to the 3' termini of exogenous RNA molecules, 
either in a template-dependent (RdRp) or template -independent 
(TNTase) fashion. 

The invention also extends to a new composition of- matter, 
characterized in that it comprises proteins whose sequences are 
described in SEQ ID NO: 1 or sequences contained therein or 
derived therefrom. It is understood that this sequence may vary 
in different HCV isolates, since all RNA viruses show a high 
degree of variability. This new composition of matter has the 
.RdRp activity necessary to HCV in order to replicate its genome. 



The present invention also has as its object the use of this 
composition of matter in order to prepare an enzymatic assay 
capable of identifying, for therapeutic purposes, compounds that 
inhibit the enzymatic activities associated with NS5B, including 
inhibitors of the RdRp and that of the TNTase. 

Up to this point, a general description has been given of . 
the present invention. With the aid of the following examples, a 
more detailed description of specific embodiments thereof will 
now be given, in order to give a clearer understanding of its 
objects, characteristics, advantages and methods of operation. 

Figure 1 shows the plasmid constructs used for HCV cDNA transfer 
into a baculovirus expression vector. 

Figure 2 shows the plasmid used for the in vitro synthesis 
-of the D-RNA substrate of the RNA-dependent RNA polymerase [pT7^7 
(DCoH) ] , and for the expression of the RNA-dependent RNA 
polymerase in E. coli cells [pT7-7 (NS5B) ] , respectively. 

Figure 3 shows a schematic drawing of (+) and {-) -strands of 
D-RNA. The transcript contains the coding region of the DCoH 
mRNA. The DNA-oligonucleot ides a, b and c were designed to anneal 
with the newly- synthesized antisense RNA and the DNA/RNA hybrid 
was subjected to cleavage with RNase H. The lower part of the 
diagram depicts the expected RNA fragment sizes generated by 
RNase digestion of the RNA {-) hybrid with oligonucleotides a, b 
and c, respectively. 

DEPOSITS 



DHl E. coli bacteria, transformed using the plasmids pBac 5B, 
pBac 25, pT7-7 DCoH and pT7.7NS5B - containing SEQ ID NO: 1; SEQ 



ID NO: 2; the cDNA for transcription of SEQ ID NO: 12; and SEQ ID 
NO: 1, respectively, filed on May 9, 1995, with The National 
Collections of Industrial and Marine Bacteria Ltd. (NCIMB) , 
Aberdeen, Scotland, UK, under access numbers NCIMB 40727, 40728, 
40729 and 40730, respectively. . . 

EXAMPLE 1 

Method of Expression of HCV RdRp/TNTase in Spodoptera frugiperda 
Clone 9 Cultured Cells 

Systems for expression of foreign genes in insect cultured 
cells, such as Spodoptera frugiperda clone 9-(Sf9) cells infected 
with baculovirus vectors are known in the art (V.A. Luckow, 
Baculovirus systems for the expression of human gene products, 

(1993) Current Opinion in Biotechnology 4, pp. 564 - 572). 
Heterologous genes are usually placed under the control of the 
strong polyhedrin promoter of the Autographa californica nuclear 
polyhedrosis virus of the Bombyx mori nuclear polyhedrosis virus. 
Methods for the introduction of heterologous DNA in the desired 
site in the baculoviral vectors by homologous recombination are 
also known in the art (D.R. O'Reilly, L.K. Miller, V.A. Luckow, 

(1992), Baculovirus Expression Vectors - A Laboratory Manual, 
W.H. Freeman and Company, New York) . 

Plasmid vectors pBacSB and pBac25 are derivatives of 
pBlueBacIII (Invitrogen) and were constructed for transfer of 
genes coding for NS5B and other non- structural HCV proteins in 
baculovirus expression vectors. The plasmids are schematically 
illustrated in Figure 1 and their construction is described in 
detail in Example 8. Selected fragments of the cDNA corresponding 
to the genome of the HGV-BK isolate (HCV-BK; Takamizawa, A., 
Mori, C, Fuke, I., Manabe, S., Murakami, S., Fujita, J., Onishi, 



E., Andoh, T. , Yoshida, I., andOkayama, H., (1991), Structure 
and Organization of the Hepatitis C Virus Genome Isolated from 
Human Carriers, J. Virol., 65, 1105 - 1113) were cloned under the 
strong polyhedrin promoter of the nuclear polyhedrosis virus and 
flanked by sequences that allowed homologous recombination in a 
baculovirus vector. 

In order to construct pBacBD, a PGR product containing the 
cDNA region encoding amino acids 2420 to 3010 of the HCV 
polyprotein and corresponding to the NS5B protein (SEQ ID NO: 1) 
was cloned between the BariiRI and Hindlll sites of pBlue BacIII. 
The PGR sense oligonucleotide contained a translation initiation 
signal, whereas the original HGV termination codon serves for 
, translation termination. 

PBac25 is a derivative of pBlueBacIII (Invitrogen) where the 
cDNA region coding for amino acids 810 to 3010 of the HCV-BK 
polyprotein (SEQ ID NO: 2) was cloned between the Ncol and the 
Hindlll restriction sites. 

Spbdoptera frugiperda clone 9 (Sf 9) cells and baculovirus 
recombination kits were purchased from Invitrogen. Cells were 
grown on dishes or in suspension at 27°G in complete Grace's 
insect medium (Gibco) containing 10% fetal bovine serum (Gibco) • 
Transf ection, recombination, and selection of baculovirus 
constructs were performed as recommended by the manufacturer. Two 
recombinant baculovirus clones, Bac25 and Bac5B, were isolated 
that contained the desired HCV cDNA. 

For protein expression, Sf9 cells were infected either with 
the recombinant baculovirus Bac25 or Bac5B at a density of 2 x 
10^ cells per ml in a ratio of about 5 virus particles per cell. 
48 - 72 hours after infection, the Sf9 cells were harvested by 



centrifuging, washed once with phosphate buffered saline (PBS) 
and carefully resuspended (7.5 x lo'' cells per ml) in buffer A 
(10 mM Tris/Cl pH-8, 1.5 mM MgCl2/ 10 tnM NaCl) containing 1 mM 
dithiothreitol (DTT) , 1 mM phenylmethylsulf onyl fluoride (PMSF, 
Sigma) , and 4 mg/ml leupeptin. All the following steps were 
performed between 0° and 4°C: after swelling for 30" minutes, the 
cells were disrupted by 2 0 strokes in a Bounce homogenizer using 
a tight-fitting pestle. Glycerol, as well as the detergents 
Nonidet P-40 (NP40) and 3 -[ (3-Cholamidopropyl) -dimethyl -ammonio] - 
1-propanesulfonate (CHAPS) , were added to final concentrations of 
10% (v/v) , 1% (v/v) and 0.5% (w/v) , respectively, and the 
cellular extract was incubated for. a further hour on ice with 
occasional agitation. The nuclei were pelleted by centrif ugation 
for 10 minutes at 1000 x g, and the supernatant was collected. 
The pellet was resuspended in buffer A containing the above 
concentrations of glycerol and detergents (0.5 ml per 7.5 x 10 
nuclei) by 20 strokes in the Dounce homogenizer and then 
incubated for one hour on ice. After repelleting the nuclei, both 
supernatants were combined, centrifuged for 10 minutes at 8000 x 
g and the pellet was discarded. The resulting crude cytoplasmic 
extract was used either directly to determine the RdRp activity 
or further purified on a sucrose gradient (see Example 5) . 

Infection of Sf9 cells with either the recombinant 
baculovirus Bac25 or Bac5B leads to the expression of the 
expected HCV proteins. Indeed, following infection. of Sf9 cells 
with Bac25, correctly-processed HCV NS2 (24 kDa) , NS3 (68 kDa) , 
NS4B (26 kDa), NS4A (6 kDa), NS5A (56 kDa), and NS5B (65 kDa) 
proteins can be detected in the cell lysates by SDS 
polyacrylamide gel electrophoresis (SDS-PAGE) and immunostaining . 
Following infection of Sf9 cells with Bac5B, only one HCV-encoded 
protein, corresponding in size to authentic NS5B (65 kDa) , is 



detected by SDS-PAGE followed by immuno- or Coomassie Blue 
staining. 

EXAMPLE 2 

Method of Assay of Recombinant HCV RdRp on a Synthetic RNA 
Template/Substrate 

The RdRp assay is based on the detection of labeled 
nucleotides incorporated into novel RNA products. The in vitro 
assay to determine RdRp activity was performed in a total volume 
of 40 lil ' containing 1 - 5 >il of either Sf9 crude cytoplasmic 
extract or purified protein fraction. Unf ractionated or purified 
cytoplasmic Sf 9 cell extracts infected with Bac25 or Bac5D may be 
used as the source of HCV RdRp. A Sf9 cell extract obtained from 
cells infected with a recombinant baculovirus construct 
expressing a protein that is not related to HCV may be used as a 
negative control. The following supplements are added to the 
reaction mixture (final concentrations): 20 mM Tris/Cl pH 7.5, 
5 mM MgClz/ 1 mM DTT, 25 mM KCl, 1 mM EDTA, 5-10 pCi [^^P] NTP 
of one species (unless otherwise specified, GTP, 3000 Ci/mmol, 
Amersham, was used), 0.5 mM each NTP (i.e., OTP, UTP, ATP unless 
otherwise specified), 20 U RNasin (Promega) , 0.5 pg RNA-substrate 
(ca. 4 pmol; final concentration 100 nM) , 2 \xg actinomycin D 
(Sigma) . The reaction was incubated for two hours at room 
temperature, stopped by the addition of an equal volume of 2 x 
Proteinase K (PK, Boehringer Mannheim) buffer (300 mM NaCl, 100 
mM Tris/Cl pH 7.5, 1% w/v SDS) and followed by half an hour of 
treatment with 50 pg of PK at 37°C- RNA products were PCA 
extracted, precipitated with ethanol, and analyzed by 
electrophoresis on 5% polyacryl amide gels containing 7M urea. 



The RNA substrate we normally used for the assay (D-RNA) had 
the sequence reported in SEQ ID NO: 12, and was typically 
obtained by in vitro transcription of the plasmid pT7-7(DCoH) 
with T7 polymerase; as described below. 

Plasmid pTl-l (DCoH) (Figure 2) was linearized with the 
unique Sgllll restriction site contained at the end of the DCoH 
coding sequence and transcribed in vitro with T7 polymerase 

(Stratagene) using the procedure described by the manufacturer. 
Transcription was stopped by the addition of 5 U/10 pi of Dnasel 

(Promega) . The mixture was incubated for a further 15 minutes and 
extracted with phenol/chlorof orm/isoamyl alcohol (PCA) . 
Unincorporated nucleotides were removed by gel filtration through 
a 1 ml Sphadex G50 spun column. After extraction with PCA and 
ethanol precipitation, the RNA was dried, redissolved in water, 
and its concentration determined by optical density at 260 nm. 

As will be clear from. the experiments described below, any 
RNA molecule other than D-RNA may be used for the RdRp assay of 
the invention. 

The above -described HCV RdRp assay gave rise to a 
characteristic pattern of radioactively-labeled reaction 
products: one labeled product that co-migrated with the substrate 
RNA was observed in all reactions, including the negative 
control. This RNA species could also be visualized by silver 
staining and was thus thought to correspond to the input 
substrate RNA, labeled most likely by terminal nucleotidyl 
transferase activities present in cytoplasmic extracts of 
baculovirus- infected Sf9 cells. In the reactions carried out with 
the cytoplasmic extracts of Sf9 cells infected with either Bac25 
or Bac5B, but not of cells infected with a recombinant 
baculovirus construct expressing a protein that is not related to 



HCV, an additional bank was observed, migrating faster than the 
substrate RNA. This latter reaction product was found to be 
labeled to a high specific activity, since it could be detected 
solely by autoradiography and not by silver staining. This novel 
product was found to be derived from the externally- added RNA 
template, as it was absent from control reactions where no RNA 
was added. Interestingly, the formation of a labeled species 
migrating faster than the substrate RNA was consistently observed 
with a variety of template RNA molecules, whether containing the 
HCV 3' untranslated region or not. The messenger RNA of the 
liver- specif ic transcription cof actor DCoH (D-RNA) turned out to 
be an efficiently accepted substrate in our RdRp assay. 

In order to define the nature of the species generated in 
the reaction by the .Bac25- or Bac5B-inf ected cell extracts, we 
carried out the following series of experiments: (i) The product 
mixture was treated with RNase A or Nuclease PI. As this resulted 
in the complete disappearance of the radioactive bands, we 
concluded that both the labeled products were RNA molecules, 
(ii) Omission from the reaction mixtures of any of the four 
nucleotide triphosphates resulted in labeling of only the input 
RNA, suggesting that the faster-migrating species is a product of 
a polymerization reaction, (iii) Omission of Mg^"^ ions from the 
assay caused a complete block of • the reaction: neither synthesis 
of the novel RNA nor labeling of the RNA substrate was observed, 
(iv) When the assay was carried out with a radioactively labeled 
input RNA and unlabeled nucleotides, the labeled product was 
indistinguishable from that obtained under the standard 
conditions. We concluded from this result that the novel RNA 
product is generated from the original input RNA molecule. 

Taken together, our data demonstrate that the extracts of 
Bac25- or Bac5B-inf ected Sf9 cells contain a novel magnesium- 



dependent enzymatic activity that catalyzes de novo RNA 
synthesis. This activity was shown to be dependent upon the 
presence of added RNA, but independent of an added primer or of 
the origin of the input RNA molecule. Moreover, as the products 
generated by extracts of Sf9 cells infected with either Bac25 or 
Bac5B appeared to be identical, the experiments just described 
indicate that the observed RdRp activity is encoded by the . HGV 
NS5B protein. 

EXTU^PLE 3 

Methods for the Characterization of the HCV RdRp RNA Product 

The following methods were employed in order to elucidate 
the structural RNA product. Under our standard electrophoresis 
conditions (5% polyacrylamide, 7M urea) , the size of the novel 
RNA product appeared to be approximately 200 nucleotides. This 
could be due to either internal initiation of RNA transcription, 
or to premature termination. These possibilities, however, 
appeared to be very unlikely, since products derived from RdRp 
assays using different RNA substrates were all found to migrate 
significantly faster than their respective templates. Increasing 
the temperature during electrophoresis and the concentration of 
acrylamide in the analytic gel leads to a significantly different 
migration behavior of the RdRp product. Thus, ^ using, for 
instance, a gel system containing 10% acrylamide, 7M urea, where 
separation was carried out at higher temperatures, the RdRp 
product migrated slower than the input substrate RNA, at a 
position corresponding to at least double the length of the input 
RNA. A similar effect was observed when RNA- denaturing agents 
such as methylhydroxymercury (CHaHgOH, 10 mM) were added to the 
RdRp products prior to electrophoresis on a low-percentage/lower 



temperature gel. These observations suggest that the RdRp product 
possesses an extensive secondary structure. 

We investigated the susceptibility of the product molecule 
to a variety of ribonucleases of different specificity. The 
product was completely degraded upon treatment with RNase A. On 
the other hand, it was found to be surprisingly resistant to 
single-strand specific nuclease RNase Tl. The input RNA was 
completely degraded after 10 minutes incubation with 60 U RNase 
Tl at 22^C, and silver staining of the same gel confirmed that 
not only the template, but also all other RNA usually detectable 
in the cytoplasmic extracts of Sf9 cells, was completely 
hydrolyzed during incubation with RNase Tl . In contrast, the RdRp 
product remained unaltered and was affected only following 
prolonged incubation with RNase Tl . Thus, after 2 hours of 
treatment with Rnase Tl the labeled product molecule could no 
longer be detected at its original position in the gel. Instead, 
a new band appeared that had an electrophoretic mobility similar 
to the input template RNA. A similar effect was observed when 
carrying out the RNase Tl digestion for 1 hour, but at different 
temperatures: at 22°C, the RdRp product remained largely 
unaffected whereas at 37®C it was converted to the new product 
that co-migrated with the original substrate. 

The explanation for these observations is that the input RNA 
serves as a template for the HCV RdRp, where the 3' -OH is used to 
prime the synthesis of the complementary strand by a '"turn" or 
"'copy-back" mechanism to give rise to a duplex RNA ''hairpin" 
molecule, consisting of the sense (template) strand to which an 
antisense strand is covalently attached. Such a structure would 
explain the unusual electrophoretic mobility of the RdRp product 
on the polyacrylamide gels as well as its high resistance to 
single-strand specific nucleases. The hairpin RNA molecule should 



not be base -paired and therefore ought to be accessible to the 
nucleases. Treatment with RNase Tl thus leads to the hydrolysis 
of the covalent link between the sense and antisense strands to 
yield a double -stranded RNA molecule. During denaturing gel 
electrophoresis, the two strands become separated and only the 
newly- synthesized antisense strand, which should be similar in 
length to the original RNA template, would remain detectable. 
This - mechanism would appear rather likely, especially in view of 
the fact that this kind of product is generated by several other 
RNA polymerases in vitro. 

The following experiment was designed to demonstrate that 
the RNA product labeled during the polymerase reaction and 
apparently released by RNase Tl treatment exhibits antisense 
orientation with respect to the input template. 

For this purpose, we synthesized oligodeoxyribonucleotides 
■ corresponding to three separate sequences of the input template 
RNA molecule (Figure 2), oligonucleotide a, corresponding to 
nucleotides. 170 - 195 of D-RNA (SEQ ID NO: 3); oligonucleotide b, 
complementary to nucleotides 286 ~ 309 (SEQ ID NO: 4) ; 
oligonucleotide c, complementary to nucleotides 331 - 354 (SEQ ID 
NO: 5) . These were VLsed to generate DNA/RNA hybrids with the 
product of the polymerase reaction, such that they could be 
subjected to RNase H digests. Initially, the complete RdRp 
product was used in the hybridizations. However, as this 
structure is too thermostable, no specific hybrids were formed. 
The hairpin RNA was therefore pre- treated with RNase Tl,. 
denatured by boiling for 5 minutes and then allowed to cool down 
to room temperature in the presence of the respective 
oligonucleotide. As expected, exposure of the hybrids to RNase H 
yielded specific cleavage products. Oligonucleotide a-directed 
cleavage led to products of about 170 and 220 nucleotides in 



length, oligonucleotide b yielded products of about 2 90 and 110 
nucleotides and oligonucleotide c gave rise to fragments of about 
330 and 65 nucleotides. As these fragments are of the expected 
sizes {'see Figure 3), the results indicate that the HCV NS5B- 
mediated RNA synthesis proceeds- by a copy-back mechanism that 
generates a hairpin-like RNA duplex. 

EXAMPLE 4 

Method of Assay of Recombinant HCV.TNTase on a Synthetic RNA 
Substrate 

The TNTase assay is based on the detection of template- 
independent incorporation of labeled nucleotides to the 3' 
hydroxyl group of RNA substrates. The DNA substrate for the assay 
(D-RNA) was typically obtained by in vitro transcription of the 
linearized plasmid pT7-7DCoH with T7 polymerase as described in 
Example 2. However, any RNA molecule other than D-RNA may be used 
for the TNTase assay of the invention. 

The in vitro assay to determine TNTase activity was 
performed in a total volume of 40 pi containing 1 - 5 |il of 
either Sf9 crude cytoplasmic extract of purified protein 
fraction. Unf ractionated or purified cytoplasmic extracts of Sf9 
cells infected with Bac25 or Bac5B may be used as the source of 
HCV TNTase. An Sf9 cell extract obtained from cells infected with 
a recombinant baculovirus construct expressing a protein that is 
not related to HCV may be used as a negative control . The 
following supplements are added to the reaction mixture (final 
concentrations) : 20 mM Tris/Cl pH 7.5, 5 mM MgClj/ 1 mM DTT, 25 
mM KCl, 1 mM EDTA, 5-10 yCi [^^P] NTP of one species (unless 
otherwise specified, UTP, 3000 Ci/mmol, Amersham, was used), 20 U 
RNasin (Promega), 0.5 ]ig RNA-substrate (ca. 4 pmol) ;. final 



concentration 100 mM) , 2 yg actinomycin D (Sigma). The reaction 
was incubated for two hours at room temperature; stopped by the 
addition of an equal volume of 2 x Proteinase K (PK, Boehringer 
Mannheim) buffer (300 mM NaCl, 100 mM Tris/Cl pH 7.5, 1% w/v SDS) 
and followed by half an hour of treatment with 50 yg of PK at 
37°C. RNA products were PCA extracted, precipitated with ethanol 
and analyzed by electrophoresis on 5% polyacrylamide gels 
containing 7M urea. 

EXAMPLE 5 

Method for the Purification of the HCV RdRp/TNTase by Sucrose 
Gradient Sedimentat ion 

A linear 0.3 - 1.5 M sucrose gradient was prepared in 
buffer A containing detergents (see Example 1) . Up to. 2 ml of 
extract of Sf9 cells" infected with Bac5B or Bac25 (corresponding 
to about 8 X lO'' cells) were loaded onto a 12 ml gradient. 
Centrifugation was carried out for 20 hours at 39000 x g using a 
Beckmann SW40 rotor. 0 . 5 ml fractions were collected and assayed 
for activity. The NS5B protein, identified by immunostaining, was 
found to migrate in the density gradients with an unexpectedly 
high sedimentation coefficient. The viral protein and ribosomes 
were found to co-sediment in the same gradient fractions. This 
behavior enabled us to separate the viral protein from the main 
bulk of cytoplasmic proteins, which remained on the top of the 
gradient. The RdRp activity assay revealed that the RdRp activity 
co-sedimented with the NS5B protein. A terminal nucleotidyl 
transferase activity was also present in these fractions. 

EXAMPLE 6 



Method for the Purification of the HCV TNTase/RdRp from Sf9 Cells 



Whole cell extracts are made from 1 g of Sf9 cells infected 
with BacSB recombinant baculovirus. The frozen cells are thawed 
on ice in 10 ml of buffer containing 20 mM Tris/HCl pH 7.5, 1 mM 
EDTA, 10 mM DTT, 50% glycerol (N buffer) supplemented with 1 mM 
PMSF. Triton X-100 and NaCl are then added to a final 
concentration of 2% and 500 mM, respectively, in order to promote 
cell breakage. After the addition of MgCla (10 mM) and DNa'se I 
(15 pg/ml) , the mixture is stirred at room temperature for 30 
minutes. The extract is then cleared by ultracentrif ugation in a 
Beckman centrifuge, using a 90 Ti rotor at 40000 rpm for 30 
minutes at 4°C. The extract is diluted with a buffer containing 
20 mM Tris/HCl pH 7.5, 1 mM EDTA, 10 mM DTT, 20% glycerol, 0.5% 
Triton X-100 (LG buffer) in order to adjust the NaCl 
concentration to 3 00 mM, and incubated with 5 ml of DEAE- 
Sepharose Fast Flow, equilibrated in LG buffer containing 300 mM 
NaCl . The matrix is then poured into a column and washed with two 
volumes of the same buffer. -The ''flow- through" and the first wash 
of the DEAE-Sepharose Fast Flow column is diluted 1:3 with LG 
buffer and applied onto a heparin- Sepharose CL6B column (10 ml) 
equilibrated with LG buffer containing 100 mM NaCl. The Heparin- 
Sepharose CL6B is washed thoroughly and the bound proteins are 
eluted with a linear 100 ml gradient, from 100 mM to IM NaCl in 
LG buffer. The fractions containing NS5B, as judged by silver- 
and immunostaining of SDS-PAGE, are pooled and diluted with LG 
buffer in order to adjust the NaCl concentration to 50 mM. The 
diluted fractions are subsequently applied to a Mono Q-FPLC 
column (1 ml) equilibrated with LG buffer containing 50 mM NaCl. 
Proteins are eluted with a linear gradient (20. ml) from 50 mM to 
1 M NaCl in.LG buffer. The fractions containing NS5B, as judged 
by silver- and immunostaining of SDS-PAGE, are pooled and diluted 
with LG buffer to adjust the NaCl concentration to 50 mM. The 
fractions containing NS5B, as judged by silver- and' 



immunostaining by SDS-PAGE, are pooled and dialyzed against LG 
buffer containing 100 mM NaCl . After extensive dialysis, the 
pooled fractions were loaded onto poly (U) -Sepharose CL6B (10 ml) 
equilibrated with LG buffer containing 100 mM NaCl . The poly(U)- 
Sepharose CL6B was washed thoroughly and the bound proteins were 
eluted with a linear 100 ml gradient, from 100 mM to IM NaCl in 
LG buffer. The fractions containing NS5B, as judged by silver- 
and immunostaining of SDS-PAGE, are pooled, dialyzed against LG 
buffer containing 100 mM NaCl and stored in liquid nitrogen prior 
to activity assay. 

Fractions containing the purified protein. NS5B were tested 
for the presence of both activities. The RdRp and TNTase 
activities were found in the same fractions. These results 
indicate that both activities, RNA~dependent RNA polymerase and 
terminal ' ribonucleotide transferase, are the functions of the HCV 
NS5B protein. 

We tested the purified NS5B for terminal nucleotidyl 
transferase activity with each of the four ribonucleotide 
triphosphates at non-saturating .substrate concentrations. The 
results clearly showed that UTP is the preferred TNTase 
substrate, followed by ATP, CTP and GTP, regardless of the origin 
of the input RNA. 

EXAMPLE 7 

Method of Expression of HCV RdRp/TNTase in E. coli 

The plasmid pT7-7(NS5B), described in Figure 2 and Example 
8, was constructed in order to allow expression in E. coli of the 
HCV protein fragment having the sequence reported in SEQ ID NO 1. 
Such protein fragment contains the RdRp and the TNTase of NS5B, 



as discussed above. The fragment of HCV cDNA coding for the NS5B 
protein was thus cloned downstream of the bacteriophage T7 010 
promoter and in frame with the first ATG codon of the phage T7 
gene 10 protein, using methods that are known in molecular 
biology practice and described in detail in Example 8. The pT7- 
7{NS5B) plasmid also contains the gene for the b-lactamase enzyme 
that can be used as a marker of selection of E. coli cells 
transformed with plasmid pT7-7(NS5B) . 

The plasmid pT7-7(NS5B) was then transformed in the E. coli 
strain BL21(DE53), which is normally employed for high-level 
expression of genes cloned into expression vectors containing T7 
promoter. In this strain of E. coli, the T7 gene polymerase is 
carried on the bacteriophage X DE53, which is integrated into the 
chromosome of BL21 cells (Studier and Moffatt, Use of 
bacteriophage T7 RNA polymerase to direct selective high-level 
expression of cloned genes, (1986), J. Mol . Biol, 189, pp. 113 - 
130) . Expression from the gene of interest is induced by addition 
of isopropylthiogalactoside (IPTG) to the growth medium according 
to a procedure that has been previously described (Studier and 
Moffatt, 1986) . The recombinant NS5B protein fragment containing 
the RdRp is thus produced in the inclusion bodies of the host 
cells. Recombinant NS5B protein can be purified from the 
particulate fraction of E. coli B121(DE53) extracts and refolded 
according to procedures that are known in the ,art (D.R. Thatcher 
and A. Hitchcock, Protein folding in Biotechnology, (1994), in 
^^Mechanism of protein folding", R.H. Pain, EDITOR, IRL PRESS, pp. 
229 - 255) . 

EXAMPLE 8 - 

Detailed Construction of the Recombinant Vectors Expressing NS5B 



Selected fragments of the cDNA corresponding to the genome 
of the HCV-BK isolate (HCVBK) were cloned under the strong 
polyhedrin promoter of the nuclear polyhedrosis virus and flanked 
by sequences that allowed homologous recombination in a 
baculovirus vector. 

pBac5B contains the HCV-BK sequence comprised between 
nucleotide 7590 and 9366, and codes for the NS5B protein reported 
in SEQ ID NO: 1. In order to obtain this plasmid, a cDNA fragment 
was generated by PGR using synthetic oligonucleotides having the 
sequences 5 ' -AAGGATCCATGTCAATGTCCTACACATGGAC-3 ' (SEQ ID NO: 6) 
and 5' -AATATTCGAATTCATCGGTTGGGGAGCAGGTAGATG-3' (SEQ ID NO: 7), 
respectively. The PGR product was then treated with the Klenow 
DNA polymerase, digested at the 5' end with SainHI, and 
subsequently cloned between the BamHI and Smal sites of the 
Bluescript SK{+) vector. Subsequently, the cDNA fragment of 
interest was digested out with the restriction enzymes BamHI and 
Hindlll and religated in the same sites of the pBlueBacIII vector 
(Invitrogen) . . 

pBac2 5 contains the HCV-BK cDNA region comprised between 
nucleotides 2759 and 9416 and codes for amino acids 810 to 3010 
of the HCV-BK polyprotein (SEQ ID NO: 2). This construct was 
obtained as follows: first, the 820 bp cDNA fragment containing 
the HCV-BK sequence comprised between nucleotides 2759 and 3578 
was obtained from pCD(38-9.4) (Tomei, L., Failla, C., Santolini, 
E., De Francesco, R., and La Monica, N. (1993) NS3 is a Serine. 
Protease Required for Processing of Hepatitis C Virus 
Polyprotein, J. Virol., 67, 4017 - 4026) by digestion with Ncol 
and cloned in the Ncol site of the pBlueBacIII vector 
(Invitrogen) yielding a plasmid called pBacNCO. The cDNA fragment 
containing the HCV-BK sequence comprised between nucleotides . 1959 
and 9416 was obtained from pCD (38-9.4) (Tomei et ai, 1993) by 



digestion with NotI and Xbal and cloned in the same sites of the 
Bluescript SK(+) vector yielding a plasmid called pBlsNX. The 
cDNA fragment containing the HCV-BK sequence comprised between 
nucleotides 3304 and 9416 was obtained from pBlsNX by digestion 
with SacII and Hindlll and cloned in the same sites of the pBlsNX 
plasmid, yielding the pBac25 plasmid. 

PT7-7(DCoH) contains the entire coding region (316 
nucleotides) of the rat dimerization cof actor of hepatocyte 
nuclear factor-la {DCoH; Mendel, D.B., Khavari, P. A., Conley, 
P.B., Graves, M.K.,. Hansen, L.P., Admon, A., and Crabtree, G.R., 
(1991), Characterization of a. Cof actor that Regulates 
Dimerization of a Mammalian Homeodomain Protein, Science, 254, 
1762 - 1767; GenBank accession number: M83740) . The cDNA fragment 
corresponding to the coding sequence for rat DCoH was amplified 
by PGR using the synthetic oligonucleotide Dprl and Dpr2 that 
have the sequence TGGCTGGCAAGGCACACAGGCT (SEQ ID NO. : 8) and 
AGGCAGGGTAGATCTATGTC (SEQ ID .NO: 9), respectively. The cDNA 
fragment thus obtained was cloned into the Smal restriction site 
of the E. coli expression vector pT7-7. The pT7-7 expression 
vector- is a derivative of pBR322 .that contains, in addition to 
the S-lactamase gene and the Col El origin of replication, the T7 
polymerase promoter 010 and the translational start site for the 
T7 gene 10 protein (Tabor, S., and Richerdoson, C.C. (1985) A 
bacteriophage T/ RNA polymerase/promoter system for controlled 
exclusive expression of specific genes, Proc. Natl. Acad. Sci. 
USA, 82, 1074 - 1078) . 

pT7-7(NS5B) contains the HCV sequence from nucleotide 7590 
to nucleotide 9366, and codes for the NS5B protein reported in 
SEQ ID NO: 1. In order to obtain this plasmid, a cDNA fragment 
was generated by PGR using oligonucleotides having the sequences 
5-TCAATGTCCTACACATGGGAC-3' (SEQ ID NO: 10) and 5'- 



GATCTCTAGATCATCGGTTGGGGGAGGAGGTAGATGCC-3' (SEQ ID NO: 11), 
respectively. The PGR product was then treated with the Klenow 
DNA polymerase, and subsequently ligated in the E. coli 
expression vector pT7-7 after linearizing it with EcoRI and 
blunting its extremities with the Klenow DNA polymerase. 



SEQUENCE LISTING 

GENERAL INFORMATION 

(i) APPLICANT: ISTITUTO DI RICERCHE DI BIOLOGIA MOLECOLARE P. 
ANGELETTI S.p.A. 

(ii) TITLE OF INVENTION: METHOD FOR REPRODUCING IN VITRO THE . 
RNA-DEPENDENT RNA POLYMERASE AND TERMINAL NUCLEOTIDYL 
TRANSFERASE ACTIVITIES ENCODED BY HEPATITIS C VIRUS (HCV). 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Societa Italiana Brevetti 

(B) STREET: Piazza di Pietra, 39 

(C) CITY: Rome 

(D) COUNTRY: Italy 

(E) POSTAL CODE: 1-00186 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy Disk 3.5" 1.44 MB 

(B) COMPUTER: IBM PC Compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS Rev. 5.0 

(D) SOFTWARE: Microsoft Word 6.0 

(viii) ATTORNEY INFORMATION. 

(A) NAME: DI CERBO, Mario (Dr.) 
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(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: 06/6785941 

(B) TELECOPIER: 06/6794692 

(C) TELEX: 612287 ROPAT 



(1) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS . 

(A) LENGTH: 591 amino acids 

(B) TYPE: amino acid 



(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: .No 

(iv) ANTI SENSE: No 

(v) FRAGMENT TYPE: C-terminal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(C) ISOLATE: BK . 

(vii) IMMEDIATE SOURCE: cDNA clone pCD(38-9.4) described 
by Tomei et al , 1993 

(ix) CHARACTERISTICS : 

(A) NAME: NS5B Non-structural protein 

(C) IDENTIFICATION METHOD: Experimental. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 



[SEQUENCES] 



l.R.B.Jl.UiK. a>.it. .Ni_if_i'-A 



10 00 19:00 F.-a -39 0(581093854 i ■ R ■ B ■ M 



-37- 

«ra(38-9 4) d«3crit-o da Tcnusi ed altri. 1993 
FONTS IWZDIATA: CDNA cloue pOOS 9.4) 

(-•x) CXRATTSRI STICKS: 

NDKB: Prct^ina noa-struttuxal« IJSSa 
(C, KETODODI IDEMTIrlCAZlONE: .Spertmentale 
DES«lZIO«E DSUA SEQCEKZA: SEQ ID .NO: 1. 
•Se. «e. sex Ty. T^p T^ O.y .eu T^^ ..o C., ^a 
C-.U S« .y3 L X.. -a I S« .sn S.. .eu .eu ^ 



20 



25 ! 



M.-- ry. M. T« -mr s.r s.r .ly 

40 I * = 

.1, ^.y. Lyl val Thr A.? Leu Glr. Val Leu Asp Asp Hi, Ty. 

"so " " 

A3P vax Leu Ly, cxu «ec Ly, Ala Lys :Ala s.. T.r Val Lys Ala 

70 : ^- 

Z ^eu Leu Ser Val Clu Glu Ala Cys Lys ,leu Thr Pro Pre Hi« Ser 
93 " 
Lva Ser Lya Phe Oly Tyr Gly Ala Ly3 :Asp Val A--. A« Leu S« 

■ . ^« ■ i 

-.ys Ala val A.U Eis He Ei= Ser Val;Trp Ly, Wp Leu Leu Glu 

120 ; "V 

. ,iB iven Ttr -^hr He! Met Ala Lys Asr. Glu Val 

AS3 Tar Val Tir Ere He nsp x^- .at ^ 

135 i 

cys val Gla Pro' Glu Lys Gly Gly atjIlv. ?r= Ala Arg Leu Zle 



"5 . ■ 

, Vai --e pro A.. Leu Gly Val A.g Val Cys i Glu Ly, Hat A.a Lau Tyr 
' ifis 170j 

AS, val val Sar Thx L.u .ro Gin Val Valj Met Gly Ser Ser Tyr Gly 

185 



lao i 
Gin Tyr Sar Pro Gly Gin Arg Val GlJ Phe L«u Val Asn Thr Trp 

Ly. ser Lys Ly. A.u Pro Met Gly 7^, Ser Tyr Asp Tiir Arg Cya Phe 

A., sar Thr Val Thr Glu Aan Aap I'. . Arg. Val Glu Glu Ser He Tyr 

Gl. Cya Cya Aap Leu Ala Pro Glu Ala Arg Gin Ala He Ly, Ser Lau 

245 250 255 

Ti.- Glu Axg Leu Tyr He Gly Gly Pro Lai Aan Ser Ly- Gly Gin 



2€0 



2SS 



270 



10 00 19:00 FAI ^39 0691093834 I . R.B . M ^R^CIENTIFICA 



-38- 
I 



Ala Ser Gly Val Leu Thr Thr Ser 



; 38S 



Asn cys Gly Tyr A=g Arg cys Axg 

280 

n-H. CVS Tyr Leu Lys sar Ala Ala Cy» Arg 
e/s Gly ASH Thr Leu Thr Cys Tyr teu y 

„ ■ 295 500 

rin A« cy« Thr M^c Leu Val Aan Gly A3p A^P 
Ma Ala Lys ^^^^ 320 

315 

310 

305 ^"-^ 



v.. XI. C= T.. Oln 01. "P 

310 

32S 

^1* M«t Thr Arg Tyr Ser Ala Pro Pro Gly Asp 
Arc Val Thr Glu Ala Met Tnr i^g 



345 ^50 
pro Ola Trl Glu Tyr As, Leu Glu Leu He Thr Ser Cys Sar Ser 

360 3SS 
val sir V.1 Ala His As, Ala Ser C-ly Ly, A., val x-yr L.u 
370 

Thx Tnr -ro Leu Ala Arg Ala Ala Trp Glu Thr Ala 
Thr Arg Aso Pro Thx -ro = 

Thr pro val Zl Ser Trp Leu Gly 'Aan He tie Me. Tyr Ala 



Arc KLfl 



405 



410 



415 



... Thx Leu Trp Ala Arg Me. He '-u H.t Thr Kia Ph. Phe Ser He 

425 ' 

Leu L.u Ala Til Glu 01. Leu Olu Ly. Ala Leu «p cy. Gin He T,^ 
43S 

Oly Ala Cy, Tyx Ser He Glu Pro Leu A.p Leu Pro Gin He He Glu 
;u-g Leu Hi. Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly 



480 

47Q 



475 

465 



.,3Iu lie A.n Arg Val Ala Ser Cy. Leu Arg; Lys Leu Gly Val Pro Pro 
48S . 



Lau Arg 



val Tr? Arg His Axg Ala Arg S.r Val Arg Ala Arg Leu Leu 



505 



510 



ser om Gly Gly Axg Ala Ala Thr Cys Gly Lye Tyr Leu Phe Asn Txp 
515 

Ala val Lya Thr Lye Leu Ly« Leu Thr Pro He Pro Ala Ala Ser Arg 

335 5*° 
Z Leu ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly A^ He 
• ■ 5SS SSO 

't^ His .Ser Leu Ser Arg Ala Axg Pro Arg Trp Phe Met Leu Cys Leu 
SfiS 

Lftu Leu Lau Ser Val Gly 



5^5 

Val Gly He Ty^ ^ ^ 

530 



530 585 



INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 2201 amino acids 

(B) TYPE: amino acid 

(C) NUMBER OF CHAINS: single . 

(D) CONFIGURATION: linear 

(ii) MOLECULE. TYPE: polypeptide 

(iii) HYPOTHETICAL: No 

(iv) ANTI SENSE: No 

(v) FRAGMENT TYPE: C- terminal fragment 

(vii) IMMEDIATE SOURCE: cDNA clone pCD(38-9.4) described 

by Tomei et al, 1993 
(ix) CHARACTERISTICS: 

(A) NAME: NS2-NS5B Non- Structural protein 

precursor 

(C) IDENTIFICATION METHOD: Experimental 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



[SEQUENCES] 



10 



I.R.B.H.DIR^SCIE.SliflCA 



@004 



•39- 



(2) IWrORMAZIONl SUUA SEQ ID NO- 2: ' 
til CWlArrHRXSTlCHE DZUA SSQOSHZA: 

(A) LUNGEEZZA: 2201 aRminoacidl 

(BV TIPO: ammino&cididica 

(C) NOMERO Dl CATENE: singoLa 

(D) CQNTIGOHAZIONE . linear ft 
(ii) TIPO DI MOLSCOIA: polipeptide 
(iit) IPOTSTXCAi Wo 

(iv) ANTiSENSO: No 

(v) TIPO DI FRAMI^ENTO: frar=n=ntc C-Cer:ninile . 1033 
(Vii) FOJTTS n^EDIATA: cDSA cloae 00(3 6 9.4) 

(A) NOME: NS2-i;sEB Ncnstr::c=ural Proteia Precuxscr 
(c) ME-roDO DI iDEHtlTICAZIONZ-. Sperimentale 
(x^) DELIA SEQCSNIA: SSQ ID NO; 3: 

r-„ Ala Al^ Ser Cvs Gly Gly Ala Val Phe.Val Gly 
Mac Asp J^S ''•^'^ ju.a -e. . 

_ 10 ^= 

1 . = 

.p.-- T.va Val Fha Leu Ala Arg 

leu val Leu Leu Thr Leu Ser Pro T^-r T^T Lys va. -n 

2S 30 ■ . ■ 

Leu lie Trp Z I^eu Oln Tyr P.e T.r Ar, Ala Glu Ala Aep Leu 
35 

H^3 val Trp lie Pro Pro Leu A^r. Ala Arg Gly Gly Arg Asp Ala He 

55 ^° 
lie -1 Lcu cy3 -Ua Val Eia Pro Glu L.u He Jhe Asp He rt^ 

7c 80 

70 

_ J/. Leu Leu lie Ala He Leu Gly Pro Leu Met Val L.u Gin Ala Gly . 

85 90 
lie Tlir Arg Val Pro Tyr Phe Val .rg Ala Gin Gly Leu He Hi. Ala 
XOO 

Cy, Me. Leu Val Arg Lye Val Ala Gly Gly His Tyr Val Gin M« Ala 
115 

Pl.e Met Ly. Leu Gly Ala t.u T!.r Gly Thr Tyr He Tyr Asn His Leu 

13S • 
IZ Leu Arg A-p Trp Pro A=3 Ala Gly Leu Arg.A^p Leu Ala Val 
ISO 1" 
ll! val Glu tro val Val Phe Ser A3p ««t Glu Lya He He Thr 

l« ^ 
Trp Cly Ala Asp Thr Ala Ala Cy, Gly A.p He He Leu Gly Leu Pro 



16 0 



20 
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s„ ^ XX. - - 

c" ou ..u r: ... X.. ^ 
z - - - ^ r: 

Z ^ ^v. v" ox. 0- - "-^ - ^ 

250 

Cln V« I" V.1 VX <^-' '2 ^ 

^ =1, r. «y ^ ''^ "° 

.fe.- ai» T-y, Th.- ;..n v»i «p Gin MP 

- pxo "1 «y A^- ^' Z 

30S 210 

TV, «u Hi, ^, V.1 r.. .r, «s »5 

330 ^ 

s« »s =" s« ... |» «- 
s.= s" «y Gly L.U L.. cy. Jro »h. =ly Si. «. v.l 



3« 



, ^ ^, . rjiv- V3.' Ala I<y3 Ala Vai 

Ilo sue' Arg Ala Ala Val Cys Zt^ A-S C-.y va_ a-* y 

375 

;^ val pro Val C-lu S«r M,^- Glu Thr M.t Arg S« Pro V.l 

39S 

3aS 330 
. ■ A.P A3a Ser Ser .ro Pro Ala Val Pro C-ln Ser P.. Gin Val " 

^.a Hi3 L.U Si, Ala Pro Cly Sax Gly Ly. S«r Thr Lys Val Pro 

Ala Ala Xyr Ala Ala Gin aly Tyr Ly, Val L.u Val Leu A-n Pro Ser 

val Ala AXa Thr I..u Gly Pha Gly Ala Tyr Me. Sar Ly3 Ala Gly 

450 «S 
XI, pro A3n Ha Arg Thr Gly Val Arg Thr He Thr Thr Gly Ala 

465 4-'° 
1 ,.o val Thr xyr Ser Thx Tyr Gly Ly, Phe Leu Ala Aap Gly Gly Cys 

j " ■ 483 ' 

. sa^ Gly Gly Ala Tyr A«P He He Ho Cya A.p Glu Cy- Ei. sar Thr 

. SOS 



•J 



I . R . B .M.UljL-^t- it> n H<.A 



-41- 



s« T.r T^- n. L.U alv U. .ly val «a 
^ o^y ^» v.. - - 

540 

v" T.. v,l pro »i= ^ XI. V.1 .1. s« «n 

555 • 

Z C-lv =lu - ^ -^^^ "° "° "° 

56S "° ^ 

, . ^ wi, Leu lie Phe Cy3 Hia Ser Lys Lys Lys Cys A'ap 
GlY Gly ^3 

80 

.lu Leu Ta Lys t.u sar Gly Leu Gly II. Asa Ma Val Ala Xyr 

555 "° 

-.,1 e^T- Val He P-o Tbr lie Gly As? val Val 
"•yr Arg Gly Leu Asp Val Ser Val -i.e 

- V>1 V,l T!-^ ASP Al.- --.U MeC =ly T^-' ^^y 

635 

val lie A3P C/3 III Ti^- Cy= val Thr Oln Thr Val. Asp Phe S.r 

L«u AS. pro Ti^ Pt, Thr Il« Glu Ti.r Thr Thr Val Pro Gla Asp .Ala 

66S 

val ser Arg ^l Glz ;^-g Arg Gly .^g Thr- Gly Arg Gly Arg Arg Gly 

S30 

ne Tyr Trg 5he Val Thr Pro Gly Glu Arg. Pro Ser Gly Mec Phe Aan 

69S ' 

ser Tel Val Leu Cys Glu Cys Tyr Asp Ala Gly C,3 Ala Trp Tyr Glu 

. 7T e 720 

705 '^^Q • 

, ^1 e^^ val Ara Leu Arg Ala Tyr Leu Asn Thr 

Leu Thr Pro Ala Glu Thr Ser vax a.5 ^ 

725 "^^^ 
P^o Qly L=u pro val cys Gin Aso Hia Le^ Glu Trp Glu Ser Val 

Phe Thr Gly Lcu Thr Hi5 He Asp Ala His Phe Leu Ser Gin Thr Lys 

; 

Gl» Ala Gly ASP Asn Phe pro Tyr Leu Val Ala Gla Ala Thr Val 



775 780 

770 



cys Al. Arg Ala Ala Pro Pro Pro Ser Trp Asp Cla Mec Trp Lya 
c!-! Leu lie Arg Leu lys Pre Thr Leu Kii Gly Pro Thr Pro Leu^Leu 



aio 815 

805 



Tyr Arg Leu Gly Ala Val Gin Asa Glu Vai Thr Leu Thr His Pro lie 

825 •■ 830 



fton 
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Ser Ala iiap Leu Giu Val val Thr 



Thr Lys Tyr Ue Met Ala Cys Met 

S« ThX IZ val r-eu Val Gly Cly Val Leu Ala Ala Leu Ala Ala tyr 

^ - ass 

cys l11 Thr Thr .ly S=r Val Val He Val Gly Arg Il« He Leu Ser 

370 

Tly Axg pro Ala 11- Val Pre Asp Arg Glu U Leu T-^ Gin Glu 5he 

eas i ®" 

;,,p Glu Met Glu Glu Cy= Ala Ser' Hie Leu Pro Tyr II. Glu Gin Gly 

„ec Gl. Leu Ala Glu Gin .he Ly, Gin Lys :Ala Leu Gly Leu Leu Gla 

! 

T^ Ala Thr Lys Gl. Ala Glu Al. Ala xla ^Pro Val v^ Glu Ser Lys 

935 ! 540 

-'^^ Zl Leu GIU THr 7he Trp Ala Lys Hi. Mec Trp Asn Pha He 

S45 ! 

a Gly Leu Ser;Thr Leu Pra Gly Aan Pro 



970 i 



97S 



Ser Gly He Gin Tyr Leu Ala Gly 

He Ala Ser ;aa Phe Thr Alajser He Thr Ser Pro Leu 

Thr Thr Gla Ser Thr Leu Leu Phe A« He^ Leu Gly Gly Trp Val Ala 

lOCQ ' lOOS 

Ala Glr. l11 Ala Pro Pre S.r Ala Ala Ser, Ala Phe Val Gly Ala Gly 

•010 "=-5 i 

lie Ala Gly Ala Ala Val Gly Ser He Gly; Leu Gly Lya Val Leu Val 

•,030 'l025 10*0 

1025 -05° i 

A3« He Leu Ala Gly Tyr Gly Ala Gly VaZi Ala Gly Ala Leu Val Ala 



1045 105«; 



Jb, Lys val Met Ser Gly Glu Met Pro 3«-_ Thr Glu A^p L.u.Val Asn 



Lau Leu Pro 

107 5 



1060 -'-'^^ j 

Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val Val 



loao 



1085 



cy3 Ala Ala He L«u Arg Arg.Hi. Val Gly Pro Gly Glu Gly Ala Val 



1090 



1095 i HOO 



Gin Trp Met A-n Arg Leu Ha. Ala Phe Ala Ser Arg Gly Asa His Val 

ser P-o Thr Si3 Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg val Thr 
112S . 
Gin He Lau Ser Ser Leu Thr Ha Thr Gin Leu Leu Lys Arg Leu His 



1140 11*5 



' 1150 
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rr^^.^ n^ry c* '2L ^'^T Glv Ser Tto Lcu 
Gin Trp Xla Asn Glu Asp Cy- ser Pro C,. .- 7 

1155 ' 
^ val cv» T.- Va. ^ - ^^V^ 

^ aln ser I.y. Leu Pro Gin Leu Wo Gly V,^ Pro Ph» Ph. 
Trp Leu Gin Ser Lya i. i ^^^^ 

1190 3.i3 = 

=^ ^ ''^ ^ 

1210 ■ . 
1205 • 

c cv. =ir 

Oly s.r «.t II. val oly 7r= Ly. Thr^cys s.r Thx T=p H.. 

^, Th- Tb- Civ Pro Cys Thr Pro Ser 

Gly Thr Phe Pro Ila Asa Ala tyr Th. Tb- Qxy / 

1225 ■ 1260 

~ " , -1, ran T-a Arsi Val Ala Ala Glu Glu 

Pro Ala pro Asa Tyx^Ser Arg .eu .rp^^^ 

l!!'val Glu val T^^Jg val Gly ABp Phe,,5.ia Tyr Val Thr Gly Mat 
Thr Th= A.n val Lys Cy. Pro cys Gin: V.1 Pro Ala Pro Glu Phe 

Phe ser Glu Val A.p Gly val Axe La« Ei.i Arg T-yr Ala Pro Ala C-/» 

1320 ' ^^^^ 

Arg ProTe'u Lau Arg Glu Glu Val Thr PhJ Gin Val Gly Leu Asn Gin 

1335 

^J'Z val Gly ser Gin Leu Pro Cy» Ol^i^ Pro Glu Pro Asp Val Ala 

^-345 ■ i , ^ 

val Leu Thr Sar Mec Leu Thr Asp Pro Ser His Thr Ala Glu Thr . 

Ala L^, ATS Axa L.u Ala Arg Gly S.r Pr,^ Pro Ser Leu Ala ser Ser 

1385 ! 1390 
ser Ala Sar^II L«u Ser Ala Pro Ser Le^ Lys Ala Thr Cy3 Thr Thr 

I 

His val ser Pro Asp Ala Asp Leu He Glu Ala A.n Leu Leu Trp 
'"i.io j . , ^ 

Oln Glu Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu A.n Ly, 

I 

val val val Leu Asp S.r Phe Asp Pro L.f Ala Glu Glu Asp Glu 

^ Glu val ser Val Pro Ala Glu He Leu Arg Ly» Ser Lye Lys Phe 
^ ' 1470 



1460 

1 



20.-10 -00 19:05 FAX -39 0881093654 



I.R.B.H.DIR.SCIENTIFICA 



©009/021 



-44' 



14-75 



1480 



1485 



^ ASP pro ABp Tyr Val Vro Pre Val Val His Gly 
Leu Glu ser Tr?.X-ys Asp i 



1500 



^490 ^^^^ 

^^ T^e P-0 Pro He pro ?rd. Pro Arg Arg 
Cys pro Lau Pro Pro He Lys Ala P.o Pro ^ ^^^^ 

1505 



1515 

^ vol V.I ..u T.. cxu ser S« ,al Ser S« ;aa Leu 



1525 ."30 
Glu Leu Ala Thr Lya Thr Phe Gly S.r Ser 
1540 

S=r Gly Thr Ala Thr Ala Lau Pro Aap Gla 
1S55 



1S3S 

Glu Sar ser Ala Val Asp 
1=S0 

Ala Ser Asp Asp Gly Asp 
1SS5 



. v>i elu se- Tyr ser ser Mec Pro Pre Leu Glu Gly 
Lya Gly Sar Asp ^'al Giu 56- 

1S7S ^^8° 
'a-uTr! Gly A=p pro A.p Leu Ser Asp Gly | Ser Trp Ser Thr Val Ser 



Glu Glu Ala ser Glu Asp Val Val Cya Cys 

IS OS ■ 



Ser Met Ser Tyr Thr Trp 
1615 



Thr Gly Ala Leu He Thr Pro Cys Ala Ala Glu Glu Ser Lye Leu Pre 

' 1630 



1620 



162S 



.^e «a Ala Leu Ser A^n Ser Leu Leu Argj Kle Eis A3n Met Val Tyr 

Ala Thr'Th^ ser Arg Ser Ala Gly Leu Ar^, Gin lys Lye Val Thr Phe 

1655 ! I^^O 

.3.'"! Leu Gin val Leu Asp Asp Hie Tyr Ara A.p Val Leu Lye Glu 
1S70 jlo/3 
. rc'Lv5 Ala Ly. Ala Ser thr Val Lys AlJ Lys Leu Leu Ser Val Glu • 
1685 l«9<i, 
Glu Ala eye Lys Leu Thr Pro Pro Eia Ser Ala Lye Ser Lye Phe Qly 

1710 



1700 



170S 



Tvr Gly. Al*. Ly3 Asp Val Arg Asn Leu Ser Sar Ly. Ala Val Aau His 



1715 



1720 



172S 



He Hi. Ser Val Trp Lyfl Asp Leu Leu Glu A.p Thr Val Thr Pro He 



173 0 



173S 



1740 



AS, Thr Thr 11. Met Ala Lys Asn Glu Val Phe Cys Val Glu Pro Glu 

1760 



1745 



17S0 



1755 



Lys Gly Gly Arg Ly, Pro Ala Arg leu lie Val Phe Pro Asp Leu Gly 

1775 . 



17S5 



1770 



r,-. Tftti Tvr Asp Val Val Ser Thr Leu 

val Arg Val Cya Glu Lys Mat Ala -eu lyr 

1790 



1780 



1785 



-45- 



cln =iy s« - «V P.. ^ s„ «o 01, 



1795 



laoo 



1805 



^ =1. -1 «r T=P LV. S« Ly, «n «o 

1815 

». ^ «p - - - r ^^z 

^r'' 

^ T^,- ri-i CV9 O/a I^s^ 

Asp tie Arg Val Glu Glu Ser He Tyr Gin Cy3 Of. . 



1345 



1950 



1655 



pro Glu Ala Arg Gin Ala He Lys Ser Leu 



1860 



1855 



Gly Gly pre Leu Thr Asn Ser Lya Gly Gin 



iThr Glu Axg Leu Tyr :ie 
1870 

Aan Cys Gly Tyr Arg Arg 
cv. Al.. ser aly V.l Leu Thr T^^ S-r Cys Cly Asn Thr .ea T»u^ 

• i 

' ' . m, se- Ala Ala CYS Arg' Ala Ala iys L«u Gin Asp 

eye Tyr Leu Lya ftia se. Ala iu-a ^.y » . ^^^^ 

1910 191S 
c'rT^- -c Leu Val A3n Gly Asp A.p, ..u': val Vai Ue Olu S.r 

;aa Gly T^_- cm Clu A=P Ala Ala Se. Leu. Arg Val P^e ^hr ^lu Ala 



1940 



1945 I 



„.c TV. S.X pro p.. «iy ^ P- 

1960 ! 

Leu lie ser C.s Ser Ser As. Val Ser Val Ala Hi, 
1S75 i 
;.s,Ta ser Gly Ly. Ar/val T^^ Tyr Leu Tbr Arg A=p Pro inr 

1,005 2000 

. Trl'Leu Ala Are -la Trp Glu T.-^' XlJ Arg V-i. Tl-.- .ro Val Asn 

2Q15 



2005 



2010 



ser rrp Leu Gly A.. He He Met Tyr All Pro Thr Leu Trp Ala Arg 

203'0 



2020 



202s 



2035 



2040 



2045 



Leu Glu Ly» Al* L«u Asp Cys Gin He Tyr Gly Ala Cy. Tyr Ser tie 
•20SO 



Clu pro r^u ^ pro GXn He IX. Gl. Arg Leu Sis Gly Leu Ser 

— — 2080 



2075 



206S 2070 
Ma .he ser Leu Hi, Ser Tyr Ser Pro Gly Glu Xla Asa Arg Val Ala 

^090 2095 
ser Cv, L.U Arg Lys Leu Gly Val Pro Pro Leu Arg Val Trp Arg Hia 

2110 



2100 



2105 
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Leu Lr Gin Sly Gly Arg Ala 



Arg 



2115 

Ala Thr cya Gly Lys ryr Leu 

2130 2135 
Lys Leu 
2145 



2120 



212S 



Phe ASH Trp Ala Val Lya Thr Lys Leu 
2140 

Xhr pro He Pro Ala Ala Ser Arg' Uu Asp Leu Ser Gly Trp 



2150 



2155 



2160 



val Ala Gly Tyr Ser Gly Gly Asp He 



2163 



2170 



;aa Arg Pro Arg Trp Ph« Met Leu cys Leu 

2135 



Tyr His Ser Leu Ser Arg 

2175 

Leu Leu Leu Ser Val Gly 
2130 



2180 

Val Gly He Tyr LftU 
2195 



Leu Pro ASH Arg 
2200 



(3) 
(i) 



iNrO^MAZIONI PEH SEQ ID NX : 3 j • 

CA?A7TERISTICHE DELLA SSQUE^^^A: i 

(A) LUNGKS22LA: 26 nuclaocidi i 

(B) TIPO: acido nucleica 
(c) nume:?.o DI CATENS: singola I 
(D) cCNrlGUJ^lOiTE:: linearft i 

TIPO DI MOLZCOLA: DNA sincatico | 
(iii) I?OTETICA: NO ; 
(•?v) AjrriSSNSO: IJo i 
(vii) 70KZE IMMEDIATA: sinC.tizzacora" di cligcnuclectidi 

{LJ<^ CA2ATTERISTICSH:: 

(A) NC>E: oligo a \ 

(C) METODO DI IDr:NTZFXCA2I0KE::gel di pcliacrilanuz^de 
(Xi) DISG^XZIONE DSLUV.SEQUKNZA: S3Q ip NO: 3 



(ii) 



GCCSAfiATGC CATCTTCAAA CAGTTC 



2S 



(4) INTORWAZIONI SULLA SEQOTl?2A; SZQ M NO; 4 
(1) CA2ATTERISTICHE DELIA SEQU2N1A; 

(A) LUNG2S22AI .24 nucleocidl 

(B) TIPO: acido nucleico 

(C) OTMERO DI.CATEN2: singola 

(D) CONflGTJRAZIONB: lineare 

(ii) T1?0 DI MDLSCOIA: WX aicicetico 

(iii) IPOTETICA: No 

(^•v) ANTISSNSO: No ^ 
(vli) FONTS IMMEDIATA: »iac.tiz«Mr. di olicoaucleocidi 

(ix) CAHATT2RISTICSE: 



(3) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 26 nucleotides 

(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI SENSE: No 

(vii) IMMEDIATE SOURCE: oligonucleotide synthesizer 
(ix) CHARACTERISTICS: 

(A) NAME: oligo a 

(C) IDENTIFICATION METHOD: Polyacryl amide gel 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GCCGAGATGC CATCTTCAAA CAGTTC 26 

(4) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 24 nucleotides 

(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTISENSE: No 

(vii) IMMEDIATE SOURCE: oligonucleotide synthesizer 
(ix) CHARACTERISTICS: 

(A) NAME: oligo b 

(C) IDENTIFICATION METHOD: Polyacrylatnide gel. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
GTGTACAACA AGGTCCATAT CACC 24 

(5) INFORMATION FOR SEQ ID NO: 5: ' ■ 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 24 nucleotides 

(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CHARACTERISTICS: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI SENSE: No 

(vii) IMMEDIATE SOURCE: oligonucleotide synthesizer 
( ix) CHARACTERI STICS : 

(A) NAME: oligo C 

(C) IDENTIFICATION METHOD: Polyacrylamide gel 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
GGTCTTTCTG AACGGGATAT AAAC 24 

(6) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 nucleotides 

(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) ' MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTISENSE: No 

(vii), IMMEDIATE SOURCE: oligonucleotide synthesizer 
(ix) • CHARACTERISTICS: 

(A) NAME: 5'-5B 

(C) IDENTIFICATION METHOD: Polyacrylamide gel 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
AAGGATCCAT GTCAATGTCC TACACATGGA C 31 



(7) 



INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS 



(A) LENGTH: 3 6 nucleotides 

(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ■ ANTI SENSE: No 

(vii) IMMEDIATE SOURCE: oligonucleotide synthesizer 
(ix) CHARACTERISTICS: 

(A) NAME: 3'-5B 

(C) IDENTIFICATION METHOD: Polyacrylamide gel 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
AATATTCGAA TTCATCGGTT GGGGAGCAGG TAGATG 36 

(8) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 22 nucleotides 

(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CHARACTERISTICS: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTISENSE: No 

(vii) IMMEDIATE SOURCE: oligonucleotide synthesizer 
(ix) CHARACTERISTICS: 

(A) NAME: Dprl 

(C) IDENTIFICATION METHOD: Polyacrylamide gel 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
TGGCTGGCAA GGCACACAGG CT 22 



(9) 



INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 20 nucleotides 



(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTISENSE: No 

(vii) IMMEDIATE SOURCE: oligonucleotide synthesizer 
( ix) CONFIGURATION : 

(A) NAME : Dpr2 

(C) IDENTIFICATION METHOD: Polyacryl amide gel 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
AGGCAGGGTA GATCTATGTC 20 

(10) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTISENSE: No 

(vii) IMMEDIATE SOURCE: oligonucleotide synthesizer 
(ix) CHARACTERISTICS: 

(A) NAME: NS5B-5' (1) 

(C) IDENTIFICATION METHOD: Polyacryl amide gel 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TCAATGTCCT ACACATGGAC 20 



(11) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 38 nucleotides 

(B) TYPE: nucleic acid 



(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI SENSE: No 

(vii) IMMEDIATE SOURCE: oligonucleotide synthesizer 
(ix) CHARACTERISTICS: 

(A) NAME: HCVA-13 

(C) IDENTIFICATION METHOD: Polyacrylamide gel 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GATCTCTAGA TCATCGGTTG GGGGAGGAGG TAGATGCC 38 

(12) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 399 nucleotides 

(B) TYPE: nucleic acid 

(C) NUMBER OF CHAINS: single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: mRNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI SENSE: No 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Norvegicus 
(C) STRAIN: Sprague-Dawley 

(vii) IMMEDIATE SOURCE: pT7-7 (DCoH) . 
(ix) CHARACTERISTICS: 

(A) NAME: D-RNA 

(C) IDENTIFICATION METHOD: Polyacrylamide gel 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



CLAIMS 



1. Method for reproducing in vitro the RNA-dependent RNA 
polymerase activity or the terminal nucleotidyl transferase 
activity encoded by hepatitis C virus (HCV) , wherein sequences 
containing NS5B (SEQ ID NO: 1) are used in the reaction mixture. 

2. Method for reproducing in vitro the RNA-dependent RNA 
polymerase activity encoded by HCV according to Claim 1, wherein 
NS5B is incorporated in the reaction mixture as NS2-NS5B 
precursor, said precursor generating, by means of multiple 
proteolytic events that occur in the overproducing organism, an 
enzymatically active form of NS5B. 

3. Method for reproducing in vitro the terminal 
nucleotidyl transferase activity coded for HCV according to 
Claim 1, wherein NS5B is incorporated in the reaction mixture as 
NS2-NS5B precursor, said precursor generating, by means of 
multiple proteolytic events that occur in the overproducing 
organism, an enzymatically active form of NS5B. 

4. Composition of matter containing NS5B sequences 
according to Claims 1 through 3 . 

5. Composition of matter according to. Claim 4, comprising 
the proteins whose sequences are described in SEQ ID NO: 1, in 
sequences contained therein or derived therefrom. 

6. Use of the compositions of matter according to Claims 4 
and 5 to set up an enzymatic test capable of selecting, for 
therapeutic purposes, compounds that inhibit the enzymatic 
activity associated with NS5B. 



7. Method for reproducing in vitro the RNA-dependent RNA 
polymerase and terminal nucleotidyl transferase activities, 
compositions of matter and use of said compositions of matter to 
set up an enzymatic test capable of selecting, for therapeutic 
purposes, compounds that inhibit the enzymatic activities 
associated with NS5B, according to the above description, 
examples and claims. 
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[Key] 

1. Recombination sequence 

P ETL = Promoter of the gene coding for PCNA protei 

P PH = Promoter of the polyhedrin gene 

Amp = Gene coding for the S- lactamase enzyme 

(ampicillin resistance) 
LacZ (S-gal) = Gene coding for the S-galactosi.dase 

enzyme 

Col El = pBR322 replication origin 

Figure 1 
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Hiflcn 



010 =:= Bacteriophage T7 010 promoter 

rbs Shine -Dalgarno ribosome binding site 

ATG = Translation initiation site of the protein 

coded by the bacteriophage T7 gene 10 
g-lactamase = gene coding for the g-lactamase enzyme 

(ampicillin resistance) 
Col El = pBR322. replication origin 



Figure 2 



MINISTRY OF INDUSTRY, COMMERCE AND ARTISANRY 
GENERAL ADMINISTRATION OF PRODUCTIVE DEVELOPMENT AND 

COMPETITIVENESS 
ITALIAN PATENT AND TRADEMARK OFFICE 

PATENT FOR INDUSTRIAL INVENTION 

No. 01278077 

This patent was granted for the invention that was the 
subject of the application below: 



Applica- 
tion No. 


year 


U. P. I.e. A. 


Date of 
Presenta- 
tion of 
Applica- 
tion 


Classifi - 
cation 


000343 


• 95 


ROME 


5/25/1995 


C12Q 


HC 


DLDER : 


ISTITUTO DI 
MOLECOLARE 
IN POMEZIA 


RICERCHE DI 
S.p.A. 
(ROME) 


BIOLOGIA 



AGENT : 
ADDRESS : 

TITLE : 



INVENTOR ; 



DE BENEDETTI, FABRIZIO ' 

SOCIETA ITALIANA BREVETTI SPA 
P.ZA DI PIETRA 39 
0.0100 ROME 

METHOD FOR REPRODUCING IN VITRO THE RNA- 
DEPENDENT RNA POLYMERASE AND TERMINAL 
NUCLEOTIDYL TRANSFERASE ACTIVITIES 
ENCODED BY HEPATITIS C VIRUS (HCV) 

DE FRANCESCO, RAFFAELE 
BEHRENS, SVEN ERIK 
TOMEI LICIA 



[seal] 

Ministry of Industry, 
Commerce and Artisanry 
Italian Patent and 

Trademark Office 
[canceled tax stamp] 



Rome, NOVEMBER 17, 1997 



DIRECTOR OF DIVISION V 



signed: GIOVANNA MORELLI 

CERTIFIED TRUE COPY OF THE ORIGINAL 
[Illegible signature] 

Issued on December 3, 1997 
The Director of the UPICA 
[Illegible signature] , 



